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We present the design, implementation, and evaluation of D-GRAID, a gracefully 
degrading and quickly recovering RAID storage array. D-GRAID ensures that most files 
within the file system remain available even when an unexpectedly high number of faults 
occur. D-GRAID achieves high availability through aggressive replication of semantically 
critical data, and fault-isolated placement of logically related data. D-GRAID also recovers 
from failures quickly, restoring only live file system data to a h ... 

Keywords: Block-based storage, Disk array, RAID, fault isolation, file systems, smart 
disks 



4 Manageability, availability, and performance in porcupine: a highly scalable, cluster- Q 
||> based mail service 

^ Yasushi Saito, Brian N. Bershad, Henry M. Levy 

August 2000 ACM Transactions on Computer Systems (TOCS), Volume 18 issue 3 

Publisher: ACM Press 

Full text available:^] pdf(2.52 MB ) Additional Information: full citation, abstract, references , index terms 

This paper describes the motivation, design and performance of Porcupine, a scalable mail 
server. The goal of Porcupine is to provide a highly available and scalable electronic mail 
service using a large cluster of commodity PCs. We designed Porcupine to be easy to 
manage by emphasizing dynamic load balancing, automatic configuration, and graceful 
degradation in the presence of failures. Key to the system's manageability, availability, 
and performance is that sessions, data, and underlying ... 

Keywords: cluster, distributed systems, email, group membership protocol, load 
balancing, replication 
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December 1993 ACM SIGOPS Operating Systems Review , Proceedings of the 

fourteenth ACM symposium on Operating systems principles SOSP 

'93, Volume 27 Issue 5 
Publisher: ACM Press 

Full text available: jf| pdf(1.53 MB) Additional Information: full citation, abstract, references ., citings, index 
^ terms 

Recoverable virtual memory refers to regions of a virtual address space on which 
transactional guarantees are offered. This paper describes RVM, an efficient, portable, and 
easily used implementation of recoverable virtual memory for Unix environments. A 
unique characteristic of RVM is that it allows independent control over the transactional 
properties of atomicity, permanence, and serializability. This leads to considerable 
flexibility in the use of RVM, potentially enlarging the ... 
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Recoverable virtual memoryrefers to regions of a virtual address space on which 
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transactional guarantees are offered. This article describes RVM, an efficient, portable, 
and easily used implementation of recoverable virtual memory for Unix environments. A 
unique characteristic of RVM is that it allows independent control over the transactional 
properties of atomicity, permanence, and serializability. This leads to considerable 
flexibility in the use of RVM, potentially enla ... 

Keywords: Camelot, Coda, RVM, Unix, logging, paging, persistence, scalability, 
throughput, truncation 



7 The monitoring and early detection of internet worms 
Cliff C. Zou, Weibo Gong, Don Towsley, Lixin Gao 

October 2005 IEEE/ACM Transactions on Networking (TON), Volume 13 issue 5 
Publisher: IEEE Press 

Full text available:^ pdf(594. 79 KB) Additional Information: full citation , abstract , references , index terms 

After many Internet-scale worm incidents in recent years, it is clear that a simple self- 
propagating worm can quickly spread across the Internet and cause severe damage to 
our society. Facing this great security threat, we need to build an early detection system 
that can detect the presence of a worm in the Internet as quickly as possible in order to 
give people accurate early warning information and possible reaction time for 
counteractions. This paper first presents an Internet worm monitoring ... 

Keywords: computer network security, early detection, internet worm, network 
monitoring 
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8 Hibernator: help ing disk arra ys sleep throu gh the winter Q 
Qingbo Zhu, Zhifeng Chen, Lin Tan, Yuanyuan Zhou, Kimberly Keeton, John Wilkes 
October 2005 ACM SIGOPS Operating Systems Review , Proceedings of the twentieth 
ACM symposium on Operating systems principles SOSP '05, volume 39 issue 
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Publisher: ACM Press 

Full text available: ^ pdf(654.56 KB) Additional Information: full citation , abstract , references , index terms 

Energy consumption has become an important issue in high-end data centers, and disk 
arrays are one of the largest energy consumers within them. Although several attempts 
have been made to improve disk array energy management, the existing solutions either 
provide little energy savings or significantly degrade performance for data center 
workloads. Our solution, Hibernaton, is a disk array energy management system that 
provides improved energy savings while meeting performance goals. Hibernator co ... 

Keywords: disk array, disk layout, energy management, performance guarantee, storage 
system 
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international conference on Management of data SIGMOD '87, volume 

16 Issue 3 
Publisher: ACM Press 
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Increased interest in using workstations and small processors for distributed transaction 
processing raises the question of how to implement the logs needed for transaction 
recovery. Although logs can be implemented with data written to duplexed disks on each 
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processing node, this paper argues there are advantages if log data is written to multiple 
log server nodes. A simple analysis of expected logging loads leads to the conclusion that 
a high performance, microprocessor b ... 

10 Gossip-based aggregation in large dynamic networks 
Mark Jelasity, Alberto Montresor, Ozalp Babaoglu 

August 2005 ACM Transactions on Computer Systems (TOCS), Volume 23 issue 3 
Publisher: ACM Press 

Full text available: pdf(533.89 KB) Additional Information: full citation , abstract , references , index terms 

As computer networks increase in size, become more heterogeneous and span greater 
geographic distances, applications must be designed to cope with the very large scale, 
poor reliability, and often, with the extreme dynamism of the underlying network. 
Aggregation is a key functional building block for such applications: it refers to a set of 
functions that provide components of a distributed system access to global information 
including network size, average load, average uptime, location ... 

Keywords: Gossip-based protocols, proactive aggregation 



11 The Zebra striped network file system 
John H. Hartman, John K. Ousterhout 

August 1995 ACM Transactions on Computer Systems (TOCS), Volume 13 issue 3 
Publisher: ACM Press 

Full text available* H9 pdf(2 76 MB) Additional Information: full citation , abstract, references, citings, index 

terms , review 

Zebra is a network file system that increases throughput by striping the file data across 
multiple servers. Rather than striping each file separately, Zebra forms all the new data 
from each client into a single stream, which it then stripes using an approach similar to a 
log -structured file system. This provides high performance for writes of small files as well 
as for reads and writes of large files. Zebra also writes parity information in each stripe in 
the style of RAID disk arrays; this ... 

Keywords: RAID, log-based striping, log -structured file system, parity computation 



12 The TickerTAlP parallel RAID architecture 

Pei Cao, Swee Boon Lin, Shivakumar Venkataraman, John Wilkes 
August 1994 ACM Transactions on Computer Systems (TOCS), volume 12 issue 3 
Publisher: ACM Press 

Full text available: fgl pdf(2.04 MB) Additional Information: full citation , abstract , references , citings, index 

terms 

Traditional disk arrays have a centralized architecture, with a single controller through 
which all requests flow. Such a controller is a single point of failure, and its performance 
limits the maximum number of disks to which the array can scale. We describe 
TickerTAlP, a parallel architecture for disk arrays that distributes the controller functions 
across several loosely coupled processors. The result is better scalability, fault tolerance, 
and flexibility.This article present ... 

Keywords: RAID disk array, decentralized parity calculation, disk scheduling, distributed 
controller, fault tolerance, parallel controller, performance simulation 
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May 1993 ACM SIGARCH Computer Architecture News , Proceedings of the 20th 

annual international symposium on Computer architecture ISCA '93, volume 
21 Issue 2 
Publisher: ACM Press 

Full text available' 151 pdfd 19 MB) Additional Information: full citation , abstract , references , citings , index 
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Traditional disk arrays have a centralized architecture, with a single controller through 
which all requests flow. Such a controller is a single point of failure, and its performance 
limits the maximum size that the array can grow to. We describe here TickerTAlP, a 
parallel architecture for disk arrays that distributed the controller functions across several 
loosely-coupled processors. The result is better scalability, fault tolerance, and flexibility. 
This paper presents the Tic ... 

15 Implementing sequentially consistent shared objects using broadcast and point-to- Q 
^ point communication 

Alan Fekete, M. Frans Kaashoek, Nancy Lynch 

January 1998 Journal of the ACM (J ACM), volume 45 issue l 

Publisher: ACM Press 

Full text available* odf(257 13 KB) Additional Information: full citation , abstract , references , citings, index 

' terms 

This paper presents and proves correct a distributed algorithm that implements a 
sequentially consistent collection of shared read/update objects. This algorithm is a 
generalization of one used in the Orca shared object system. The algorithm caches 
objects in the local memory of processors according to application needs; each read 
operation accesses a single copy of the object, while each update accesses all copies. The 
algorithm uses broadcast communication when it sends messages to replic ... 

Keywords: Orca programming language, distributed shared memory, formal methods, 
input/output automata, ordered multicast, replicated data 

16 Tolerating multiple failures in RAID architectures with optimal storage and uniform Q 
^ declustering 

^ Guillermo A. Alvarez, Walter A. Burkhard, Flaviu Cristian 

May 1997 ACM SIGARCH Computer Architecture News , Proceedings of the 24th 

annual international symposium on Computer architecture ISCA '97, volume 

25 Issue 2 
Publisher: ACM Press 

Full text available- ■f Bpdffl.SOMB) Additional Information: full citation , abstract, references , citings, index 

terms 

We present DATUM, a novel method for tolerating multiple disk failures in disk arrays. 
DATUM is the first known method that can mask any given number of failures, requires an 
optimal amount of redundant storage space, and spreads reconstruction accesses 
uniformly over disks in the presence of failures without needing large layout tables in 
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controller memory. Our approach is based on information dispersal, a coding technique 
that admits an efficient hardware implementation. As t ... 

17 The Zebra striped network file system 
John H. Hartman, John K. Ousterhout 

December 1993 ACM SIGOPS Operating Systems Review , Proceedings of the 

fourteenth ACM symposium on Operating systems principles SOSP 

'93, Volume 27 Issue 5 
Publisher: ACM Press 

Full text available- ^|pdf(1 93 MB) Additional Information: full citation , abstract , references , citings , index 

Zebra is a network file system that increases throughput by striping file data across 
multiple servers. Rather than striping each file separately, Zebra forms all the new data 
from each client into a single stream, which it then stripes using an approach similar to a 
log-structured file system. This provides high performance for writes of small files as well 
as for reads and writes of large files. Zebra also writes parity information in each stripe in 
the style of RAID disk arrays; this increase ... 

™ A 50-Gb/s IP router 

Craig Partridge, Philip P. Carvey, Ed Burgess, Isidro Castineyra, Tom Clarke, Use Graham, 
Michael Hathaway, Phil Herman, Allen King, Steve Kohalmi, Tracy Ma, John Mcallen, Trevor 
Mendez, Walter C. Milliken, Ronald Pettyjohn, John Rokosz, Joshua Seeger, Michael Sollins, 
Steve Storch, Benjamin Tober, Gregory D. Troxel 

June 1998 IEEE/ ACM Transactions on Networking (TON), volume 6 issue 3 
Publisher: IEEE Press 

Full text available- ffi pdfd 33.28 KB) Additional Information: full citation , references , citings, index terms, 

review 
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^ Alan Burns, Brian Dobbing, Tullio Vardanega 
V June 2004 ACM SIGAda Ada Letters, volume xxiv issue 2 
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Full text available: pdf(548.17 KB) Additional Information: full citation , references 



20 Routin g : Ensurin g cache freshness in on-demand ad hoc network routing protocols Q 

Yih-Chun Hu, David B. Johnson 
N/ October 2002 Proceedings of the second ACM international workshop on Principles of 
mobile computing 

Publisher: ACM Press 

Full text available* pdfd 31 62 KB) Additional Information: full citation , abstract , references , citings , index 
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In a wireless ad hoc network, nodes cooperate to forward packets for each other over 
possibly multi-hop paths, to allow nodes not within direct wireless transmission range to 
communicate. Many routing protocols have been proposed for the ad hoc network 
environment, several of which operate on-demand and utilize a route cache listing links 
that this node has learned. In such protocols, aggressive caching of overheard routes can 
significantly improve performance; in particular, overhead can ... 
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Hierarchical disk cache management in RAID 5 controller 
Jung-ho Huh, Tae-mu Chang 

December 2003 Journal of Computing Sciences in Colleges, volume 19 issue 2 
Publisher: Consortium for Computing Sciences in Colleges 

Full text available: Q pdf(137.71 KB ) Additional Information: full citation , abstract , references , index terms 

In RAID system, disk cache is one of the important elements in improving the system 
performance. Two-level cache displays superior performance in comparison to single 
cache and is effective in temporal and spatial locality. The proposed cache system 
consists in two levels. The first level cache is a set associative cache with small block size 
whereas the second level cache is a fully associative cache with large block size. In this 
paper, a RAID 5 disk cache model is presented that is located on ... 



2 Data partitioning and load balancin g in parallel disk systems 
Peter Scheuermann, Gerhard Weikum, Peter Zabback 

February 1998 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 7 Issue 1 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ jpdf(31Q.27 KB) Additional Information: full citation , abstract , citings , index terms 

Parallel disk systems provide opportunities for exploiting I/O parallelism in two possible 
ways, namely via inter-request and intra-request parallelism. In this paper, we discuss 
the main issues in performance tuning of such systems, namely striping and load 
balancing, and show their relationship to response time and throughput. We outline the 
main components of an intelligent, self-reliant file system that aims to optimize striping 
by taking into account the requirements of the applications, an ... 

Keywords: Data allocation, Disk cooling, File striping, Load balancing, Parallel disk 
systems, Performance tuning 
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This paper addresses the selection of technologies that provide each student group a 
dedicated environment on a non-dedicated host machine. The authors investigated 
different combinations of enabling technologies and approaches, such as virtual machine 
technology, storage technology, and host operating system. Performance tests were 
developed and executed to profile the performance of the technologies. The results of this 
work provide an evaluation of the studied technologies and a selection gui ... 

Keywords: VMware, course development, curriculum, end-user computing, innovative 
lab strategies in IT, interesting applications in IT, networking, operating systems, systems 
software 



4 False sharing problems in cluster-based disk arrays Q 
Hai Jin, Kai Hwang 

February 1999 Proceedings of the 1999 ACM symposium on Applied computing 
Publisher: ACM Press 

Full text available: Qpdf(618.92 KB) Additional Information: full citation , references , index terms 




Keywords: RAID, clusters of workstations, false sharing, storage system architecture 



The HP AutoRAID hierarchical storage system 
John Wilkes, Richard Golding, Carl Staelin, Tim Sullivan 

February 1996 ACM Transactions on Computer Systems (TOCS), Volume 14 issue l 
Publisher: ACM Press 

Full text available: fg] pdf(1.82 MB) Additional Information: full citat ion, abstract, references, citings, index 

terms 

Configuring redundant disk arrays is a black art. To configure an array properly, a system 
administrator must understand the details of both the array and the workload it will 
support. Incorrect understanding of either, or changes in the workload over time, can lead 
to poor performance. We present a solution to this problem: a two-level storage hierarchy 
implemented inside a single disk-array controller. In the upper level of this hierarchy, two 
copies of active data are stored to provide f ... 

Keywords: RAID, disk array, storage hierarchy 
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Arpaci-Dusseau 

March 2004 ACM SIGARCH Computer Architecture News , Proceedings of the 31st 
annual international symposium on Computer architecture ISCA '04, 

Volume 32 Issue 2 
Publisher: IEEE Computer Society, ACM Press 

Full text available: Q pdf(250.59 KB) Additional Information: fuil citation , abstract , citings 

RAID storage arrays often possess gigabytes of RAM forcaching disk blocks. Currently, 
most RAID systems use LRUor LRU-like policies to manage these caches. Since these 
arraycaches do not recognize the presence of file system buffer caches,they redundantly 
retain many of the same blocks as those cachedby the file system, thereby wasting 
precious cache space. In thispaper, we introduce X-RAY, an exclusive RAID array 
cachingmechanism. X-RAY achieves a high degree of (but not perfect) exclusivitythr ... 
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Volume 24 Issue 10 
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Full text available- fiB pdf(1.19MB) Additional Information: full citation , abstract , references , citings, index 
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The Choices operating system architecture [3, 4, 15] uses class hierarchies and object- 
oriented programming to facilitate the construction of customized operating systems for 
shared memory and networked multiprocessors. The software is being used in the 
Tapestry Parallel Computing Laboratory at the University of Illinois to study the 
performance of algorithms, mechanisms, and policies for parallel systems. This paper 
describes the architectural design and class hierarchy of ... 
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September 2001 Journal on Educational Resources in Computing (JERIC) 
Publisher: ACM Press 
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August 1985 ACM Transactions on Computer Systems (TOCS), Volume 3 issue 3 
Publisher: ACM Press 

Full text available - fsS odf(3 13 MB) Additional Information: full citation, abstract, references , citings, index 

terms, review 

The current trend of computer system technology is toward CPUs with rapidly increasing 
processing power and toward disk drives of rapidly increasing density, but with disk 
performance increasing very slowly if at all. The implication of these trends is that at 
some point the processing power of computer systems will be limited by the throughput of 
the input/output (I/O) system. A solution to this problem, which is described and 
evaluated in this paper, is disk cache 
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Zebra is a network file system that increases throughput by striping the file data across 
multiple servers. Rather than striping each file separately, Zebra forms all the new data 
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from each client into a single stream, which it then stripes using an approach similar to a 
log -structured file system. This provides high performance for writes of small files as well 
as for reads and writes of large files. Zebra also writes parity information in each stripe in 
the style of RAID disk arrays; this ... 

Keywords: RAID, log-based striping, log-structured file system, parity computation 



12 The automatic improvement of locality in storage systems 
Windsor W. Hsu, Alan Jay Smith, Honesty C. Young 

November 2005 ACM Transactions on Computer Systems (TOCS), Volume 23 issue 4 
Publisher: ACM Press 

Full text available: ^]pdf( 2.58 MB ) Additional Information: full citation , abstract , references , index terms 

Disk I/O is increasingly the performance bottleneck in computer systems despite rapidly 
increasing disk data transfer rates. In this article, we propose Automatic Locality- 
Improving Storage (ALIS), an introspective storage system that automatically reorganizes 
selected disk blocks based on the dynamic reference stream to increase effective storage 
performance. ALIS is based on the observations that sequential data fetch is far more 
efficient than random access, that improving seek distances prod ... 

Keywords: Data layout optimization, block layout, data reorganization, data 
restructuring, defragmentation, disk technology trends, locality improvement, prefetching 
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Witoid Litwin, Rim Moussa, Thomas Schwarz 

September 2005 ACM Transactions on Database Systems (TODS), Volume 30 issue 3 
Publisher: ACM Press 

Full text available:^ pdf(774. 32 KB) Additional Information: full citation , abstract , references , index terms 

LH&ast; RS is a high-availability scalable distributed data structure (SDDS). An LH&ast;RS 
file is hash partitioned over the distributed RAM of a multicomputer, for example, a 
network of PCs, and supports the unavailability of any k &geq; 1 of its server nodes. The 
value of k transparently grows with the file to offset the reliability decline. Only the 
number of the storage nodes potentially limits the file growth. The high-availability 
management uses a novel ... 

Keywords: P2P, Scalable distributed data structure, grid computing, high-availability, 
linear hashing, physical database design 
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Current operating systems offer poor performance when a numeric application's working 
set does not fit in main memory. As a result, programmers who wish to solve "out-of- 
core" problems efficiently are typically faced with the onerous task of rewriting an 
application to use explicit I/O operations (e.g., read/write). In this paper, we propose and 
evaluate a fully automatic technique which liberates the programmer from this task, 
provides high performance, and requires only minima ... 
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Publisher: ACM Press 
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Data sets in large applications are often too massive to fit completely inside the 
computers internal memory. The resulting input/output communication (or I/O) between 
fast internal memory and slower external memory (such as disks) can be a major 
performance bottleneck. In this article we survey the state of the art in the design and 
analysis of external memory (or EM) algorithms and data structures, where the goal is to 
exploit locality in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 
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As improvements in processor performance continue to far outpace improvements in 
storage performance, I/O is increasingly the bottleneck' in computer systems, especially in 
large database systems that manage huge amoungs of data. The key to achieving good 
I/O performance is to thoroughly understand its characteristics. In this article we present 
a comprehensive analysis of the logical I/O reference behavior of the peak 
productiondatabase workloads from ten of the world's largest corporatio ... 

Keywords: I/O, TPC benchmarks, caching, locality, prefetching, production database 
workloads, reference behavior, sequentiality, workload characterization 
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First 50 documents 



An Age-Threshold Algorithm for Garbage Collection in.. - Menon, Stockmeyer (1998) (1 citation) (Correct) 

....the file system. Because LSA is implemented in an outboard controller which has no understanding of files, it is 
more appropriate to think of LSA as a log structured track manager rather than a log structured file system. In this 
respect, LSA has some similarities to Loge [4] and to Logical Disk [3], both of which are implemented 
below the file system. In LSA, updated data is written into new disk locations instead of being written in place. 
Large amounts of updated data are collected in controller memory and written together to a contiguous segment 
on the disks. Parity on this data is also .... 

W. de Jonge, M. F. Kaashoek, and W. C. Hsieh, The logical disk: a new approach to improving file systems, Proc. 
14th ACM Symposium on Operating System Principles, Dec. 1993, Asheville, NC, pp. 15-28. 



FT-NFS: an Efficient Fault Tolerant NFS Server Designed for.. - Peyrouze, Muller (1996) (2 citations) (Correct) 

....within the container. It is therefore impossible to use directly the Unix names of the files. This led us to design 
a second file system at the user INRIA FT NFS: an Efficient Fault Tolerant NFS Server for Off the shelf 
Workstations 11 level and to separate file management from disk management [21], the latter being 
provided by Unix. Instead of being internally denoted by an inode, a file in FT NFS is named with an ID. A first 
consequence of this choice is that file attributes are kept contiguously with the file data to allow the 
implementation of hardware links. This has the important .... 

W. de Jonge, M.F. Kaashoek, and W.C. Hsieh. The logical disk: A new approach to improving file systems. In 
Proc. of 14th ACM Symposium on Operating Systems Principles, December 1993. 



Simplifying Distributed File Systems Using a Shared Logical Disk - Shillner, Felten (1996) (6 citations) (Correct) 



....view to users. The result is a major simplification of the file system implementation. Our approach is based on 
picking some ideas from the work of others, and combining them with some new ideas, particularly in the area of 
fault tolerance. From de Jonge et al. we take the logical disk concept [8], which we extend to the 
distributed case. From Attanasio et al. we take the idea of striping a logical device across distributed physical 
devices [2] which we extend by adding caching and cache coherence. From Mann et al. we take the idea of 
expressing consistency requirements as ordering .... 

....extend by applying it to cache to cache transfers. The result of combining these ideas, and adding some 
support for fault tolerance, is a base layer that makes it easy to write non centralized distributed file systems. 1 . 1 
Logical disk The shared logical disk is based on concept of logical disk [8]. A logical disk is an array of fixed 
size data blocks, which processes may use to stably store persistent data. Processes refer to the data blocks via 
logical block numbers; the logical disk maps these block numbers onto physical disk addresses. The logical disk 
is free to store the blocks at any .... 

[Article contains additional citation context not shown here] 

Wiedbren de Jonge, M. Frans Kaashoek, and Wilson C. Hsieh. The logical disk: A new approach to improving file 
systems. In Proceedings of the 14th Symposium on Operating System Principles, pages 15-28, 1993. 
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Symphony: An Integrated Multimedia File System - Shenoy, Goval. Rao. Vin (1997) (27 citations) (Correct ) 

....the storage space is statically partitioned and the disk bandwidth is dynamically shared by logical volumes. In 
contrast, Symphony is a physically integrated file system, in which both storage space and disk bandwidth are 
dynamically shared among data types. Finally, the logical disk abstraction [15] provides an interface that 
allows multiple file systems to coexist on a single storage device. Logical disks provide functionalities similar 
to those provided by the data type independent layer of Symphony, such as multiple block sizes, location hints, 
etc. However, a key difference between .... 

W. Jonge, M, F. Kaashoek, and W. C. Hsieh. The Logical Disk: A New Approach to Improving File Systems. In 
Proceedings of the Fourteenth Symposium on Operating Systems Principles, 1993. 



A Comparison of Two Distributed Disk Systems - Lee. Thekkath. Whitaker. Hogg (1991) (1 citation) (Correct) 

....sites because they only require access to a general purpose network and, by necessity, are designed to handle 
intermittent communication failures between nodes. 3. 2 Address Mapping Storage systems that manage 
multiple disks typically support virtual disk abstractions to simplify management [6, 19]. A virtual disk can 
span multiple physical disks for performance, mirror data for reliability, and can implement a variety of 
administrative policies. When clients access a virtual disk, the virtual disk addresses must be mapped to the 
corresponding physical disk addresses. Most storage systems .... 

Wiebren de Jonge, M. Frans Kaashoek, and Wilson C. Hsieh. The logical disk: A new approach to improving file 
systems. In Proceedings of the ACM Symposium on Operating Systems Principles, pages 15-28, 1989. 



A Comparison of Two Distributed Disk Systems - Lee, Thekkath, Whitaker, Hogg (1991) (1 citation) (Correct ) 

....and users become more sophisticated and increase beyond what can easily be satisfied by a few disk array 
controllers, managing the many disparate components of the storage system becomes a severe problem. There 
has been considerable research devoted to designing large scale storage systems [1 3, 5, 8, 10, 14, 18, 
19]. A distributed disk system is a class of storage system that can reduce the complexity of building and 
managing large scale storage systems. Distributed disk systems manage collections of disks shared by, or 
partitioned across, multiple nodes as a single logical storage system that is highly .... 

Wiebren de Jonge, M. Frans Kaashoek, and Wilson C. Hsieh. The logical disk: A new approach to improving file 
systems. In Proceedings of the 14th ACM Symposium on Operating Systems Principles, pages 15-28, December 
1989. 



A Case for Compositional File Systems - Bordawekar (Correct) 

.... languages: Modula 3 or Java or C 6 Related Work The basic design approach uses the end to end 
argument [SRC84] The end to end argument has been used for designing operating and file systems 
[LS79, EKJ95] Other key projects that have influenced this design are: Exokernel Virtual disk [dJKH93] 
NASD file systems [GNA 97] xFS [ADN 96] Frangipani [TML97] HFS [Kri94] Forum [GSSW95] and 
Windows NTFS [Nag97, RvlG97] 7 Summary Traditional file systems suffer from the following 
disadvantages: application specific design, monolithic functional structure, and client server .... 

W. de Jonge, M. Frans Kaashoek, and W. C. Hsieh. The Logical Disk: A New Approach to Improving File 
Systems. In Proceedings of the Fourteenth Symposium on Operating Systems Principles, December 1993. 



MiSFIT: A Freely Available Tool for Building Safe Extensible Systems - Small ( Correct ) 

....each test representing a class of possible OS extensions. We include a short description of each test; for more 
detail, the reader is directed to the earlier paper, hotlist: choose which page to evict from a linked list of page 
descriptors, lid: simulate the operation of a logical disk layer [DeJon93]. 6 . md5: compute the MD5 
checksum [RFC1321] of 1MB of data. The tests were run on a 120MHz Pentium with 64MB of EDO memory, 
running BSD OS 2.1. Each test and its data easily fits into main memory. We report times relative to the 
unprotected version of the code. The results are found in Table .... 

de Jonge, W., Kaashoek, M. F„ Hsieh, W., "The Logical Disk: A New Approach to Improving File Systems" 
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Proceedings of the 14th SOSP, 15-28, Asheville, NC (December 1993). 



xFS: A Wide Area Mass Storage File System - Wang (1993) (16 citations) (Correct) 

....table entry is of the following form: block ID # of blocks device address As data migrate among different storage 
levels, we simply change the corresponding translation tables, a simpler and cleaner approach than extending the 
existing UFS data structures. This is similar to the approach taken by [7] where logical disk addresses are 
mapped to physical ones to allow a clean separation between file and disk management without 
sacrificing performance. The memory management analogy, unfortunately, does not apply for data layout. 
Firstly, unlike processor caches which are usually direct mapped, we .... 

W. de Jonge, M. F. Kaashoek, and W. Hsieh. The Logical Disk: A New Approach to Improving File System 
Performance. Proceedings of the 14th Symposium on Operating Systems Principles, December 1993. To appear. 



The Swarm Scalable Storage System - John Hartman (1999) (8 citations) (Correct) 

.. .as seen by a file system, data structures in the log. This is similar to running the LFS cleaner at user level, 
and enjoys the same advantages [15] Other examples of possible services are an atomic recovery unit 
(ARU) 6] service that provides atomicity across multiple log writes; a logical disk [4] service that provides 
a disk abstraction that hides the append only log, allowing higher level services and applications to 
overwrite the blocks they store; a caching service that caches log data in main memory; an encryption 
service; a compression service; etc. Distributed services, such as .... 

W. De Jonge, M. F. Kaashoek, and W. C. Hsieh. The Logical Disk: a new approach to improving file systems. 
Operating Systems Review, 27(5): 15-28, Dec. 1993. 



Rodney Van Meter - Information Sciences (Correct) 

....much like an extent based file system. Our work benefits from this work. Other possible file system structures, 
such as SGI s XFS [12] may depend on more dynamic, and hence complex, data structures, and may therefore 
not allocate blocks as predictably. A log based file system [9] or disk device [3] clearly will not, in their 
present forms, allocate blocks in a fashion amenable to improving throughput by careful choice of 
blocks. 6.2 Impact on File System Allocation Policies As proposed by Ghandeharizadeh [4] the idea of including 
a measure of ZCAV effects into a dynamic file relocater is .... 

W. de Jonge, M. F. Kaashoek, and W. C. Hsieh. The logical disk: A new approach to improving file systems. In 
Proc. Fourteenth ACM Symposium on Operating Systems Principles, pages 15-28, Dec. 1993. 



Elephant: The File System that Never Forgets - Douglas Santry (1999) (8 citations) (Correct) 

. .. a set of new utilities that exploit Elephant s novel functionality (we have already written a few utilities 
including tgrep , tls , and a history browser) Third, we are examining an alternate implementation that 
provides versioning at the level of blocks and abstracted by a logical disk [1]. Fourth, we are investigating 
how to backup an Elephant file system so that version histories can be recovered following a media failure. 
Finally, we are planning an extensive user study. To facilitate this study, we are modifying our prototype to allow 
it to shadow an NFS server. Users will thus .... 

W. de Jonge, M. Kaashoek, and W. C. Hsieh. The logical disk: A new approach to improving file systems. In 
Proceedings of the 14th Symposium on Operating Systems Princi pies, pages 15-28, December 1993. 



O pportunistic Log: Efficient Reads in a Reliable Object Server - OToole. Shrira (1994) (1 citation) (Correct) 

....on the basis of disk head position, we can expect to dramatically reduce the expected cost of an installation 
read. There is substantial previous work on delayed processing of disk write operations [18, 10] Some 
methods applied to delayed disk writes involve writing pages at new locations [4, 7, 15] and would not 
work with disk reads. However, standard disk scheduling methods based on head position apply equally well to 
read operations. In particular, Seltzer et al. 18] have shown that when a pool of 1000 operations is available, 
greedy algorithms can reduce the cost of individual .... 



http://citeseer.ist.psu.edU/context/135725/0 



4/17/06 



, Citations: The Logical Disk: A New Approach to Improving File Systems - de Jonge, Kaa... Page 4 of 6 



W. de Jonge, F. Kaashoek, and W. Hsieh. The logical disk: A new approach to improving file systems. In Proc. of 
the 14th Symposium on Operating Systems Principles, Asheville, NC, December 1993. ACM. 



Report of the Working Group on Storage I/O for Large-Scale - Gibson, Vitter. Wilkes (1996) (3 citations) 
(Correct) 

... onto spare disks, and future accesses remapped to the new devices [32] Storage devices are typically 
burdened by long positioning times, and a virtual device can be used to dynamically remap the physical 
location associated with a logical block, thus reducing the current access latency [19, 23, 57, 58, 73]. 

Additionally, most modern disk drives perform dynamic request reordering, in some cases taking advantage of 
low level information available only inside the storage device to optimize the request sequencing [35, 63] Since 
there is no single redundant disk array organization that is optimal for .... 

de Jonge, W., Kaashoek, M. F., and Hsieh, W. C. The logical disk: A new approach to improving file systems. In 
Proc. of 14th ACM Symp. on Operating Systems Principles (December 1993). 



The HP AutoRAID hierarchical storage system - Wilkes, al. (1996) (116 citations) (Correct) 

....still further. Some of the schemes described in [Menon and Courtney 1993] are also used in the dual controller 
version of the HP AutoRAID array to handle controller failures. The Loge disk drive controller [English and 
Stepanov 1992] and its foliowons Mime [Chao et al. 1992] and Logical Disk [de Jonge et al. 1993], all used 
a scheme of keeping an indirection table to fixed sized blocks held on secondary storage. None of these 
supported multiple storage levels, and none was targeted at RAID arrays. Work on an Extended Function 
Controller at HP s disk divisions in the 1980s looked at several of these issues, .... 

.... 1993; Seltzer et al. 1995] and cleaning (garbage collection) policies for them [Blackwell et al. 1995; 
McNutt 1994; Mogi and Kitsuregawa 1994] There is a large literature on hierarchical storage systems and 
the many commercial products in this domain (for example [Chen 1973; Cohen et al. 1989; DEC 1993; 
Deshpandee and Bunt 1988; Epoch Systems Inc. 1988; Gelb 1989; Henderson and Poston 1989; Katz et al. 
1991; Miller 1991; Misra 1981; Sienknecht etal. 1994; Smith 1981] together with much of the proceedings of the 
IEEE Symposia on Mass Storage Systems) Most of this work has been concerned with .... 

DE JONGE, W., KAASHOEK, M. F., AND HSIEH, W. C. 1993. The Logical Disk: a new approach to improving file 
systems. In Proceedings of 14th ACM Symposium on Operating Systems Principles. ACM, New York, 15-28. 



A Comparison of OS Extension Technologies - Small, Seltzer (1996) (53 citations) (Correct ) 

....another region of the file. If the kernel uses heuristics (rather than application knowledge) to choose a read 
ahead policy, it can not cope with arbitrary application behavior. With the cooperation of the application, it can 
make more appropriate read ahead decisions. A Logical Disk facility (LD) [DEJON93] sits between the 
filesystem and the physical disk. The filesystem reads and writes logical blocks, and the LD maps the logical 
requests to locations on the physical disk. The LD can be used to transparently replicate data, by writing it in 
multiple places on the same disk or multiple disks, and .... 

....to support a log structured layer between a filesystem and the physical disk. The simulation accepts write 
requests for logical blocks and maintains the mapping between these logical blocks and the physical blocks onto 
which they are stored. As with the system implemented by de Jonge et al. [DEJON93], our simulation 
maintains all data structures in main memory. We simulate a 1GB physical disk with 4KB blocks and 64KB (16 
block) segments. Our simulation uses a stream of block write requests that are skewed so that 80 of the requests 
are for 20 of the blocks. Because our simulation does not .... 

de Jonge, W., Kaashoek, M. F., Hsieh, W., "The Logical Disk: A New Approach to Improving File Systems" 
Proceedings of the 14th SOSP, pp. 15-28, Asheville, NC (December 1993). 



PCD - Disk Caching Disk: A New Approach for Boosting I/O.. - Hu. Yang (1996) (6 citations) ( Correct) 

....approaches 80 . In addition, LFS needs to buffer a large amount of data for a relatively long period of time in 
order to write into disk later as a log, which may cause reliability problems. There are several other approaches 
such as log structured array [11] Loge [12] and Logical Disk approach [13]. The Logical Disk approach 
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improves the I O performance by working at the interface between the file system and the disk subsystem. It 
separates file management from disk management by using logical block numbers and block lists. Logical Disk 
hides the details of disk block organization from the .... 

W. de Jonge, M. F. Kaashoek, and W. C. Hsieh, "The logical disk: A new approach to improving file systems" in 
Proceedings of the 14th ACM Symposium on Operating Systems Principles, (Asheville, NC), pp. 15-28, Dec. 
1993. 



The HP Auto RAID hierarchical storage system - Wilkes, al. (1995) (116 citations) (C orrect ) 

....mirrored form, thereby improving performance still further. Some of the schemes described in [Menon93] are 
also used in the dual controller version of the HP AutoRAID array to handle controller failures. The Loge disk 
drive controller [English92] and its follow ons Mime [Chao92] and Logical Disk [deJonge93], all used a 
scheme of keeping an indirection table to fixed sized blocks held on secondary storage. None of these 
supported multiple storage levels, and none were targeted at RAID arrays. Work on an Extended Function 
Controller at HP s disk divisions in the 1980s looked at several of these .... 

Wiebren de Jonge, M. Frans Kaashoek, and Wilson C. Hsieh. The Logical Disk: a new approach to improving file 
systems. Proceedings of 14th ACM Symposium on Operating Systems Principles (Asheville, NC), pages 15-28, 
5-8 December 1993. 



Report of the Working Group on Storage I/O for Large-Scale .. - Gibson, Vitter, Wilkes (1996) (3 citations) 
(Correct) 

.... onto spare disks, and future accesses remapped to the new devices [32] Storage devices are typically 
burdened by long positioning times, and a virtual device can be used to dynamically remap the physical 
location associated with a logical block, thus reducing the current access latency [19, 23, 57, 58, 73]. 

Additionally, most modern disk drives perform dynamic request reordering, in some cases taking advantage of 
low level information available only inside the storage device to optimize the request sequencing [35, 63] Since 
there is no single redundant disk array organization that is optimal for .... 

de Jonge, W., Kaashoek, M. F., and Hsieh, W. C. The logical disk: A new approach to improving file systems. In 
Proc. of 14th ACM Symp. on Operating Systems Principles (December 1993). 



Petal: Distributed Virtual Disks - Lee, Thekkath (1996) (133 citations) ( Correct) 

....work related to Petal in terms of four primary characteristics: type of abstraction (block level or file systemlevel) 
degree of distribution, level of fault tolerance, and support for incremental expandability. Related block level 
storage systems include RAID II [7] TickerTAlP [5] Logical Disk [8], Loge [10] Mime [6] AutoRAID [19] and 
Swift [4] Some of these systems support only simple algorithmic mappings between the address space 
seen by a client and the underlying physical disks. This mapping is usually completely specified when the 
system is configured. In contrast, AutoRAID, .... 

Wiebren de Jonge, M. Frans Kaashoek, and Wilson C. Hsieh. The logical disk: A new approach to improving file 
systems. In Proceedings of the 14th ACM Symposium on Operating Systems Principles, pages 15-28, December 
1989. 



Opportunistic Log: Efficient Installation Reads in a Reliable .. - OToole. Shrira (1994) (1 1 citations) (Correct) 

....on the basis of disk head position, we can expect to dramatically reduce the expected cost of an installation 
read. There is substantial previous work on delayed processing of disk write operations [13, 25] Some 
methods applied to delayed disk writes involve writing pages at new locations [5, 8, 22] and would not 
work with disk reads. However, standard disk scheduling methods based on head position apply equally well to 
read operations. In particular, Seltzer et al. 25] have shown that when a pool of 1000 operations is available, 
greedy algorithms can reduce the cost of individual .... 

W. de Jonge, F. Kaashoek, and W. Hsieh. The Logical Disk: A New Approach to Improving File Systems. In Proc. 
of the 14th Symposium on Operating Systems Principles, Asheville, NC, December 1993. ACM. 
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Exodisk: Maximizing Application Control Over Storage Management - Grimm (1996) (1 citation) Self-citation 
(Kaashoek) (Correct ) 

....to set their own storage allocation policies and optimizations. This approach prevents the exodisk system 
from transparently reorganizing data on disk or avoiding block overwrite semantics, as is done in some 
logical representations of disk storage (for example, in Mime [7] or the Logical Disk [8]) But it also does 
not suffer from the overheads associated with such logical to physical mappings, and it gives 
applications the flexibility to utilize the policies that are best suited for the given application. Furthermore, 
if desired, logical representations of disk storage can be implemented .... 

....self protected extent, has its own protection information) Alternatively, it can use one exonode, storing the FAT 
in the exonode as application defined data and using fixed size extent entries for the individual blocks. Logical 
representations of disk storage such as Mime [7] or the Logical Disk [8] map logical block numbers to 
physical disk locations. Such a mapping can be efficiently implemented using an exonode: The logical block 
number can be an index into the array of exonode entries and the physical block information (again, a fixed size 
extent) is stored in that exonode entry 

[Article contains additional citation context not shown here] 

Wiebren de Jonge, M. Frans Kaashoek, and Wilson C. Hsieh. The logical disk: A new approach to improving file 
systems. In Proceedings of the 14th Symposium on Operating Systems Principles, pages 15-28, Ashville, North 
Carolina, December 1993. 



Design Considerations for the Symphony Integrated.. - Shenoy. Goyal, Rao. Vin (Correct) 
No context found. 

W. Jonge, M. F. Kaashoek, and W. C. Hsieh. The Logical Disk: A New Approach to Improving File Systems. In 
Proceedings of the Fourteenth Symposium on Operating Systems Principles, 1993. 



A Tool for Constructing Safe Extensible C++ Systems - Small (1998) (7 citations) (Correct) 
No context found. 

de Jonge, W., Kaashoek, M. F., Hsieh, W., "The Logical Disk: A New Approach to Improving File Systems" Proc. 
14th SOSP, Asheville, NC, 15-28, December 1 993. 



HFS: A flexible file system for shared-memory multiprocessors - Krieger (1994) (17 citations) (Correct) 
No context found. 

Wiebren de Jonge, M. Frans Kaashoek, and Wilson C. Hsieh. The logical disk: A new approach to improving file 
systems. In Proceedings of the Fourteenth ACM Symposium on Operating Systems Principles, pages 15-28, 
1993. 

First 50 documents 

Online articles have much greater impact More about CiteSeer.lST Add search form to your site Submit 
documents Feedback 

CiteSeer.lST - Copyright Penn State and NEC 



http://citeseer.ist.psu.edU/context/135725/0 



4/17/06 



